# High-Reward Strategy
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to safely control lunar landings.
Physics Model
P
sofiascat
14
1
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, designed to solve control tasks in the LunarLander-v2 environment.
Physics Model
P
sigalaz
20
0
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to control the safe landing of a lunar lander.
Physics Model
P
andri
16
0
Td3 Hopper V3
This is a TD3 agent model trained using the stable-baselines3 library, specifically designed for reinforcement learning tasks in the Hopper-v3 environment.
Physics Model
T
sb3
30
0
Ppo HalfCheetah V3
This is a reinforcement learning model based on the PPO algorithm, specifically designed for the HalfCheetah-v3 environment and trained using the stable-baselines3 library.
Physics Model
P
sb3
51
1
Dqn LunarLander V2
This is a DQN agent trained using the stable-baselines3 library to solve reinforcement learning tasks in the LunarLander-v2 environment.
D
araffin
54
2
Featured Recommended AI Models